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Abstract 


We  show  that  a  type  system  based  on  the  intuitionistic  modal  logic  S4  provides  an  expressive 
framework  for  specifying  and  analyzing  computation  stages  in  the  context  of  functional  languages. 
Our  main  technical  result  is  a  conservative  embedding  of  Nielson  fe  Nielson’s  two-level  functional 
language  in  our  language  Mini-ML°,  which  in  addition  to  partial  evaluation  also  supports  multiple 
computation  stages,  sharing  of  code  across  multiple  stages,  and  run-time  code  generation. 
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1  Introduction 


Dividing  a  computation  into  separate  stages  is  a  common  informal  technique  in  the  derivation  of 
algorithms.  For  example,  instead  of  matching  a  string  against  a  regular  expression  we  may  first 
compile  a  regular  expression  into  a  finite  automaton  and  then  execute  the  automaton  on  a  given 
string.  Partial  evaluation  divides  the  computation  into  two  stages  based  on  the  early  availability 
of  some  function  arguments.  Binding-time  analysis  determines  which  part  of  the  computation  may 
be  carried  out  in  a  first  (static)  phase,  and  which  part  remains  to  be  done  in  a  second  (dynamic) 
phase. 

It  often  takes  considerable  ingenuity  to  write  programs  in  such  a  way  that  they  exhibit  proper 
binding-time  separation,  that  is,  that  all  computation  pertaining  to  the  statically  available  argu¬ 
ments  can  in  fact  be  carried  out.  From  a  programmer’s  point  of  view  it  is  therefore  desirable  to 
declare  the  expected  binding-time  separation  and  obtain  constructive  feedback  when  the  computa¬ 
tion  may  not  be  staged  as  expected.  This  suggests  that  the  binding-time  properties  of  a  function 
should  be  expressed  in  its  types  in  a  prescriptive  type  system,  and  that  binding-time  analysis  should 
be  a  form  of  type  checking.  The  work  on  two-level  functional  languages  [NN92]  and  some  work  on 
partial  evaluation  [GJ91,  Hen91])  shows  that  this  view  is  indeed  possible  and  fruitful. 

Up  to  now  these  type  systems  have  been  motivated  algorithmically^  that  is,  they  are  explicitly 
designed  to  support  partial  evaluation.  In  this  paper  we  show  that  they  can  also  be  motivated 
logically^  and  that  the  proper  logical  system  for  expressing  computation  stages  is  the  intuitionistic 
variant  of  the  modal  logic  S4.  This  observation  immediately  gives  rise  to  a  natural  generalization 
of  standard  binding-time  analysis  by  allowing  multiple  computation  stages,  sharing  of  code  across 
multiple  stages,  and  communication  of  binding-time  information  across  module  boundaries  via 
types. 

One  of  our  conclusions  is  that  when  we  extend  the  Curry-Howard  isomorphism  between  proofs 
and  programs  from  intuitionistic  logic  to  the  intuitionistic  modal  logic  S4  we  obtain  a  natural  and 
logical  explanation  of  computation  stages.  Each  world  in  the  Kripke  semantics  of  modal  logic 
corresponds  to  a  stage  in  the  computation.  A  term  of  type  BA  corresponds  to  code  to  be  executed 
in  the  current  or  a  future  stage  of  the  computation.  The  modal  restrictions  imposed  on  a  type  of 
the  form  BA  guarantee  that  a  function  of  type  B  BA  can  carry  out  all  computation  concerned 
with  its  first  argument  while  generating  the  residual  code  of  type  A. 

The  starting  points  for  our  investigation  are  the  systems  for  the  intuitionistic  modal  logic  S4 
in  [BdP92,  PW95]  and  the  two-level  A-calculus  in  [NN92].  We  augment  the  former  with  recursion 
to  obtain  Mini-ML°  and  then  show  that  a  two-level  functional  language  may  be  fully  and  faithfully 
embedded  in  Mini-ML°.  This  verifies  that  Mini-ML°  is  indeed  a  conservative  extension  of  the 
two-level  language  of  [NN92]  and  thus  correctly  expresses  standard  binding-time  separation.  Fol¬ 
lowing  [PW95],  we  also  sketch  a  compilation  from  Mini-ML°  to  a  related  language  Mini-ML°  whose 
operational  semantics  embodies  the  separation  of  evaluation  into  multiple  stages.  The  language 
Mini-ML°  contains  constructs  similar  to  the  Lisp  backquote,  comma  and  eval,  allowing  an  existing 
program  to  be  easily  divided  into  stages,  while  the  language  Mini-ML°  expresses  the  staging  in  a 
form  that  may  be  more  directly  executed. 
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MODAL  ANALYSIS  OF  STAGED  COMPUTATION 


2  Modal  Mini-ML:  An  Explicit  Formulation 

This  section  presents  Mini-ML°,  a  language  that  combines  some  elements  of  Mini-ML  [CDDK86] 
with  a  modal  A-calculus  for  intuitionistic  S4,  [BdP92,  PW95].  The  presentation  of  the  modal 

constructs  differs  from  A^^^  in  that  we  have  a  let  form  for  de-constructing  boxed  values,  and  use 
two  contexts  in  the  typing  rules.  This  avoids  the  need  for  syntactic  substitutions,  but  does  not 
alter  the  essential  properties  of  the  system. 

For  the  sake  of  simplicity,  we  make  the  language  explicitly  typed,  since  we  do  not  treat  type 
inference  here.  We  have  also  chosen  not  to  include  polymorphism,  because  there  are  issues  regarding 
the  interaction  between  type  variables  and  computation  stages  that  would  distract  from  the  main 
point  of  this  paper. 


2.1  Syntax 


Types  A 
Terms  E 


Contexts  P 


nst  I  Ai  — y  A2  I  A\  x  A2  |  DA 
X  I  Xx:A,  E  \  E1E2 

I  fix  x:A.  E  I  (El,  £’2)  I  fst  E  I  snd  E 
I  z  I  s  E  I  (case  Ei  of  z  E2  I  s  a;  E3) 
I  box  E  I  let  box  x  =  Eiin  E2 
•  \T,x:A 


We  use  A,  B  for  types,  P,  A  for  contexts,  and  x  for  variables  assuming  that  any  variable  can 
be  declared  at  most  once  in  a  context.  Bound  variables  may  be  renamed  tacitly.  We  omit  leading 
•’s  from  contexts.  We  write  [E'/a:]E  for  the  result  of  substituting  E'  for  x  in  E,  renaming  bound 
variables  as  necessary  in  order  to  avoid  the  capture  of  free  variables  in  E'.  The  addition  of  types  DA 
to  Mini-ML  introduces  two  new  term  constructs:  box  E  for  introduction  and  let  box  a:  =  Ei  in  E2 
for  elimination. 


2.2  Typing  Rules 

Our  typing  rules  for  the  Mini-ML  fragment  of  the  explicit  language  are  completely  standard.  The 
problem  of  typing  the  modal  fragment  is  well  understood;  we  present  here  a  variant  of  known 
systems  [BdP92,  PW95]  inspired  by  zonal  formulations  of  linear  logic  such  as  Girard’s  LU  [Gir93]. 
In  our  formulation  the  typing  judgment  has  two  contexts.  The  first  contains  variables  that  may 
appear  anywhere,  since  they  represent  code.  The  second  contains  variables  that  are  only  available 
in  the  current  computation  stage,  including  all  ordinary  Mini-ML  variables.  Thus  our  judgement 

A;PKE:  A 

would  correspond  to  □  A,  P  h  E*  :  A  in  A^°,  where  E*  is  an  appropriate  obvious  translation  of  E. 
Note  that  our  system  has  the  property  that  a  valid  term  has  a  unique  typing  derivation. 
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A-calcuIus  Fragment. 


x:A  in  P 
A;rP  a;  :  A 


tpeJvar 


A-T,x:Ah^  E  :  B 

- tpeJam 

A;rP  Aa;:A.  E:A-^5 


A-TFEiiA^B  AiPPFarA 

A;rP  EiEi-.B 


tpe-app 


Mini-ML  Fragment. 


A; r, x:A\^  E:  A 

- tpeJix 

A;  r  P  fix  a;:A.  F  :  A 


A;  r  P  :  Ai  A;  P  P  £^2  :  A2 
A;rP(i?i,F2):AixA2 


tpe_pair 


A;  r  P  :  Ai  X  ^2 

- tpe.fst 

A;rpfst  jK:  Ai 


- tpe_z 

A;  r  P  z  :  nat 


A;rP^;:  Ai  X  A2 

- tpe_snd 

A;  r  P  snd  E  :  A2 


A;rP.E:nat 

- tpe_s 

A;  r  P  s  :  nat 


A;rp£^i:nat  A;  TP  ^2:  A  A;  T,  ^:nat  P  ^3  :  A 
- tpe_case 

A;  r  P  (case  of  z  £"2  I  s  a:  =>  £3)  :  A 


Modal  Fragment. 


A;  •  P  £  :  A 

- tpe^box 

A;rPbox£:nA 


— - tpe_gvar 

A;rPx  :  A 

A;  r  P  £1  :  DA  A,  ic:A;  T  P  £2  :  £ 

- tpe_let_box 

A;  r  P  let  box  a:  —  £1  in  £2  :  £ 


Note  that  the  rule  tpe^box  does  not  allow  variables  bound  in  the  second  context  to  appear  in 
the  body  of  a  box  constructor,  and  only  the  rule  tpe_let_box  binds  variables  in  the  first  context. 
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2.3  Operational  Semantics 

The  Mini-ML  fragment  of  our  system  has  a  standard  operational  semantics.  For  the  modal  part, 
we  interpret  box  E  as  a  value  containing  the  frozen  computation  E  which  may  be  carried  out  in 
a  future  stage.  We  evaluate  let  box  x  =  Ei'in  E2  as  a  substitution  of  the  residual  code  generated 
by  El  for  x  in  E2  and  then  evaluating  E2.  The  residual  code  for  Ei  will  then  be  evaluated  during 
the  evaluation  of  E2  as  necessary. 

Note  that  if  E:A  and  E^V  then  V'.A  and  V  is  unique.  Mini-ML  has  this  property,  which  is 
easy  to  establish  by  induction  over  the  structure  of  an  evaluation.  Also  note  that  we  have  omitted 
types  in  terms  from  the  rules  below,  since  they  are  irrelevant  here. 

Values  V  ::=  \x.  E  \  (Vi,  ^2)  I  z  I  s  V  |  box  E. 


A-calcuIus  Fragment. 


- evJam 

\x,  E  ^ ^  \x^  E 


El  Xx,  El 


E2  ^  V2  [V2/x]E[  ^  F 

El  E2^V 


Mini-ML  Fragment. 


[fix  X.  E/x]E  V 

- - ev -fix 

fix  X.  E  ^  V 


El  Vi  E2^  V2 

(Ei,E2}^(Vi,V2} 


ev_pair 


E^{VuV2) 

fst  F  M-  Ft 


ev_fst 


E^{Vi,V2) 
snd  E  ^V2 


ev_snd 


- ev_z 

Z  Z 


E^V 

- ev_s 

s  E  ^  sV 


El  ^  z  E2^V 

— — - ev_case_z 

(case  i?!  of  z  £^2  I  s  x  ^  £3)  V 

El  s  VI  [Vl/x]Es  ^  V 
— - ev.-case_s 


(case  £1  of  z  =>  £2  I  s  x  £3)  ^  V 
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Modal  Fragment. 

El  ^  box  E[  [E[/x]E2  ^  V2 

- ev_box  - ev_let_box 

box  E  ^  box  E  let  box  x  =  Ei  in  E2  ^  V2 

Note  that  in  the  evaluation  of  well-typed  terms,  only  terms  inside  a  box  constructor  are  ever 
substituted  into  another  box  constructor. 

2.4  Example:  The  Power  Function  in  Explicit  Form 

We  now  show  how  we  can  define  the  power  function  in  Mini-ML°  in  such  a  way  that  has  type 
nat  — )•  □(nat  nat),  assuming  a  closed  term  times\nat  nat  nat  (definable  in  the  Mini-ML 
fragment  in  the  standard  way). 

power  =  fix  p:nat  □(nat  — >  nat). 

Anrnat.  case  n 

of  z  ^  box  (Aa;:nat.  sz) 

I  s  7n=^  letbox  q  =  p  min  box  (Aa;:nat.  times  x  (qx)) 

The  type  nat  □(nat  nat)  expresses  the  that  function  evaluates  everything  that  depends  on  the 
first  argument  of  type  nat  (the  exponent)  and  return  residual  code  of  type  □(nat  nat).  Indeed, 
we  calculate  with  our  operational  semantics: 

power  zM-  box  (Aa::nat.  s  z) 
power  (s  z)^  box  (Xxinst.  times  x  ((Aa::nat.  s  z)x)) 
power  (s  (s  z))^  box  (Aa::nat.  times  x  ((Ax:nat.  times  x  ((Aa::nat.  s  z)x))x)) 

Modulo  some  trivial  redices  of  variables  for  variables,  this  is  the  result  we  would  expect  of  partial 
evaluation. 

2.5  Implementation  Issues 

The  operational  semantics  of  Mini-ML°  may  be  implemented  by  a  translation  into  pure  Mini-ML, 
mapping  □A  to  unit  A;  box  E  to  Ati:unit.  E\  and  let  box  x  Ex  in  E2  to  (Ax^unit 
A.  [x\)/x]E2)Ei,  It  may  then  appear  that  the  modal  fragment  of  Mini-ML°  is  redundant.  Note, 
however,  that  the  type  unit  A  does  not  express  any  binding  time  properties,  while  □A  does.  It  is 
precisely  this  distinction  which  makes  Mini-ML°  interesting:  the  type  checker  will  reject  programs 
which  may  execute  correctly,  but  for  which  the  desired  binding-time  separation  is  violated.  Without 
the  modal  operator,  this  property  cannot  be  expressed  and  consequently  not  checked. 

Another  implementation  method  would  be  to  interpret  □A  as  a  data-type  representing  code 
that  calculates  a  value  of  type  A.  This  code  could  be  either  machine  code,  source  code,  or  some 
intermediate  language.  This  would  allow  optimization  after  specialization,  and  could  also  support 
an  operation  to  output  code  as  a  separate  program.  The  representation  must  support  substitution 
of  one  code  fragment  into  another,  as  required  by  the  evJet_box  rule.  If  the  code  is  machine  code, 
this  naturally  leads  to  the  idea  of  templates,  as  used  in  run-time  code  generation  (see  [KEH93]). 
The  deferred  compilation  approach  in  [LL94]  would  provide  a  more  sophisticated  implementation, 
supporting  fast  run-time  generation  of  optimized  code. 
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3  Modal  Mini-ML:  An  Implicit  Formulation 

We  now  define  an  implicit  version  Mini-ML°  of  the  explicit  Mini-ML°,  following  [PW95]  where 
an  implicit  system  was  defined.  This  system  is  more  reasonable  as  a  programming  language, 
since  we  do  not  have  to  explicitly  stage  computation  as  required  with  let  box  x  =  E\  in  E2.  The 
operational  semantics  of  the  new  system  is  given  in  terms  of  a  type-preserving  compilation  to  the 
explicit  system.  Our  development  differs  from  [PW95]  in  that  we  introduce  a  term  constructor  pop. 
This  means  that  typing  derivations  for  valid  terms  are  unique  and  the  compilation  from  implicit 
to  explicit  terms  is  deterministic,  avoiding  some  unpleasant  problems  concerning  coherence. 


3.1  Syntax 


Types  A 

Terms  M 


Contexts  P 

Context  Stacks 


nat  I  Ai  — ^  A2  I  At  X  A2  |  OA 
X  I  Xx:A.  M  I  Ml  M2 

I  fix  x:A.  M  I  (Ml,  M2)  I  fst  M  I  snd  M 
I  z  I  s  M  I  (case  Mi  of  z  =^>  M2  I  s  a;  M3) 
I  box  M  I  unbox  M  |  pop  M 
■  |r,a;:A 


All  the  categories,  except  context  stacks  are  standard.  The  importance  of  context  stacks  will 
be  apparent  when  we  present  the  typing  rules. 


3.2  Typing  Rules 

In  this  section  we  present  typing  rules  for  Mini-ML°  using  context  stacks.  The  typing  judgment 
has  the  form 

r  P  M  :  A  term  M  has  type  A  in  local  context  F  under  stack 

The  context  stack  enables  the  distinguished  use  of  variables  depending  on  their  relative  position 
with  respect  to  the  box  operators  that  enclose  the  term  being  typed.  Intuitively,  each  element  F 
of  the  context  stack  ’F  corresponds  to  a  computation  stage.  The  variables  declared  in  F  are  the 
ones  whose  values  will  be  available  during  the  corresponding  evaluation  phase.  When  we  encounter 
a  term  box  M  we  enter  a  new  evaluation  stage,  since  M  will  be  frozen  during  evaluation.  In  this 
new  phase,  we  are  not  allowed  to  refer  to  variables  of  the  prior  phases,  since  they  may  not  be 
available  when  M  is  unfrozen  (“unboxed”).  Thus,  variables  may  only  be  looked  up  in  the  current 
(innermost)  context  (rule  tpi_var)  which  is  initialized  as  empty  when  we  enter  the  scope  of  a  box 
(rule  tpi_box).  However,  code  generated  in  the  current  or  earlier  stages  may  be  used,  which  is 
represented  by  the  rules  tpi_unbox  and  tpi.pop. 
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A-calcuIus  Fragment. 

a;:Ainr  .  (F,  a;:A)  P  M  :  B 

- tpi-var  - tpiJam 

'I';rPa;:A  ^  Xx:A.  M  :  A B 

«^;rPM:A^B  «^;rPA:A 

- tpi_app 

«';rPMiV:S 


Mini-ML  Fragment. 

IF;  r,  x:A  P  M  :  A 

- - tpiJix 

^;rPfix3;:A.  M:  A 


W;  r  P  Ml  :  Ai  P  P  M2  :  A2  .  . 

- tpi-pair 

$;rP(Mi,M2):AixA2 


'F;  r  P  M  ;  Ai  X  A2 
- tpi-fst 

’F;  r  P  fst  M  :  Ai 


- : - tpi_z 

r  P  z  :  nat 


’F;  r  P  M  :  Ai  X  A2 
- tpi_snd 

r  P  snd  M  :  A2 

r  P  M  :  nat 
- tpi_s 

r  P  sM  :  nat 


T-iTPMiinat  'F;  TP  M2:  A  5^;  p,  a;:nat  P  M3  :  A  . 
- ; - tpi_case 

r  P  (case  Mi  of  z  M2  I  s  a;  M3)  :  A 


Modal  Fragment. 

T';r;-PM:A 

- tpi_box 

'F;rPbox  M:aA 


^;rPM:nA 

- tpi_unbox 

r  P  unbox  M  :  A 


'F;APM:DA 
- ^ - tpi_pop 

’F;  A;  P  P  pop  M  :  nA 

Note  that  it  may  be  useful  to  consider  the  modal  fragment  of  the  implicit  language  to  be  a 
statically  typed  analogue  to  the  quoting  mechanism  in  Lisp.  Then  box  corresponds  to  backquote 
and  unbox  (pop  ■)  to  comma,  unbox  alone  corresponds  to  eval,  while  pop  alone  corresponds 
to  quoting  an  expression  generated  with  comma.  Note  however  that  our  implementation  via  a 
compilation  to  Mini-ML°  is  quite  different  from  Lisp  quoting. 
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3.3  Examples  in  Implicit  Form 

We  now  show  how  we  can  define  the  power  function  in  Mini-ML°  in  a  simpler  form  than  in 
Mini-MLg  ,  though  still  with  type  nat  -)•  □(nat  nat).  We  use  unbox,  M  as  syntactic  sugar  for 
unbox  (pop®  M). 

power  =  fix  p:nat  — f  □(nat  — >•  nat). 

An:nat.  case  n 

of  z  =4”  box  (Axinat.  s  z) 

I  s  m=^  box  (Aa;:nat.  times  x  (unboxi  {p  m)x)) 

As  another  example,  we  show  how  to  define  a  function  of  type  nat  -)■  Gnat  that  returns  a 
box  ’ed  copy  of  its  argument: 

liftj^2it  =  /:nat  ->  Gnat.  Aa;:nat.  case  x  of 

z  =J>  box  z 

I  s  x'  box  (s  (unboxi  (/  a;'))) 

A  similar  term  of  type  A  — )•  □  A  that  returns  a  box  ’ed  copy  of  its  argument  exists  exactly  when 
A  is  observable,  i.e.,  contains  no  This  justifies  the  inclusion  of  the  lift  primitive  in  two-level 
languages  such  as  in  [GJ91],  and  in  fact  in  a  more  realistic  version  of  our  language  it  could  also  be 
included  as  a  primitive. 

3.4  Translation  to  Explicit  Language 

We  do  not  define  an  operational  semantics  for  Mini-ML°  directly;  instead  we  depend  upon  a 
translation  to  Mini-ML°.  This  translation  recursively  extracts  terms  inside  a  pop  constructor  and 
binds  the  result  of  their  evaluation  to  new  variables,  bound  with  a  let  box  outside  the  enclosing 
box  constructor.  Variables  thus  bound  occur  exactly  once. 

The  compilation  from  implicit  to  explicit  terms  is  perhaps  most  easily  understood  if  we  restrict 
pop  to  occur  only  immediately  underneath  an  unbox  or  another  pop.  On  the  pure  fragment 
terms  then  follow  the  grammar 

Terms  M  ::=  x  \  Aa;:A.  M  \  Mi  M2 
I  box  M  1  unbox  P 
Pops  P  ::=  M  I  pop  P 

The  extension  to  the  full  language  including  recursion  is  tedious  but  trivial.  Any  term  can  be 
transformed  to  one  satisfying  our  restriction  by  replacing  isolated  occurrences  of  pop  M  by 

box  (unbox  (pop  (pop  M))). 

The  compilation  below  keeps  track  of  the  context  in  which  the  term  to  be  translated  should  be 
placed  (the  k  argument).  This  is  necessary  so  that  when  we  encounter  an  pop  operator  we  can 
find  the  matching  box  operator  and  insert  a  let  box  binding  in  the  resulting  explicit  term.  We 
use  the  notation  k  =  Ah.  E  for  a  context  k  with  hole  h.  Filling  the  hole  is  written  as  an  application 
k{E').  This  must  be  implemented  as  syntactic  replacement  since  k  is  intended  to  capture  variables 
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free  in  E' .  First,  the  translation  on  terms,  [M]  k. 


Ixjk 

[Ml  M2I  k  = 
[A*.  M]  k  = 
[box  M|  k  = 
[unbox  Pjk  = 


k{x) 

[MiKAhi.  [M2](Ah2.  k{h,  h^))) 
[M](A/i.  k{\x.  h)) 

[M](A/i.  A; (box  h)) 

[P]  k  {Ah.  h) 


Nested  pop  operators  are  translated  by  traversing  the  current  context  k  from  the  inside  out  until 
a  box  operator  is  found.  This  cancels  one  pop  operator  and  continues  the  translation.  After 
all  pop  operators  have  been  removed  (possibly  none),  we  introduce  a  let  box  and  continue 
the  translation.  The  b  argument  accumulates  the  body  of  the  let  box  which  will  eventually  be 
introduced. 


[pop 

P] 

{Ah. 

k{Eih))b 

li 

0 

{Ah.  Eib{h)) 

[pop 

P]  {Ah. 

k{hE2))b 

=  [pop  P]  k 

{Ah.  b{h)  E2) 

[pop 

P] 

{Ah. 

k{\x.h))b 

=  [pop  P]  k 

{Ah.  \x.  b{h)) 

[pop 

P] 

{Ah. 

A; (box  h))b 

=  [F]A:(A/i. 

box  b{h)) 

[pop 

P] 

{Ah. 

A;  (let  box  x  = 

=  h  in  E2))  b 

[pop  P]  k 

{Ah,  let  box  x 

=  b{h)  in  E2) 

[pop 

P] 

{Ah. 

A;  (let  box  x  - 

=  El  in  h))b 

= 

[pop  P]  k 

{Ah,  let  box  x 

=  El  in  b{h)) 

[M]kb  =  [M]  {Ah.  fc(let  box  y  =  hin  b{y)))  where  y  is  new 

Since  h  must  occur  exactly  once  in  Ah.  E,  the  cases  for  [pop  P]kb  leave  out  only  Ah.  h.  If 
the  original  term  is  well-typed  this  case  can  never  arise.  An  important  invariant  of  [P]  kb  is  that 
Ah.  k{b{h))  remains  the  same  in  every  recursive  call.  At  present  we  have  not  formally  proven 
that  the  translation  above  maps  well- typed  explicit  terms  to  well-typed  implicit  terms.  A  related, 
slightly  more  complicated  translation  has  been  proven  correct  in  [PW95]. 

As  an  example  of  this  translation,  it  maps  the  above  definition  of  power  to  the  previous  explicit 
one. 

It  is  important  to  note  that  the  operational  semantics  induced  by  the  translation  is  very  different 
from  the  natural  one  defined  directly  on  Mini-ML°.  In  [MM94]  a  simple  reduction  semantics  for  a 
system  similar  to  our  implicit  system  is  introduced  which  does  not  reflect  binding  time  separation 
in  any  way.  It  is  instead  used  to  prove  a  Church-Rosser  theorem  and  strong  normalization  for  a 
pure  modal  A-calculus. 


4  A  Two- level  Language 

In  this  section  we  define  Mini-ML2,  a  two-level  functional  language  very  close  to  the  one  described 
in  [NN92].  We  then  define  a  simple  translation  into  Mini-ML°  and  prove  that  binding-time  cor¬ 
rectness  in  Mini-ML2  is  equivalent  to  modal  correctness  of  the  translation  in  Mini-ML°. 

Our  language  differs  slightly  from  [NN92]  in  that  we  inject  all  run-time  types  into  compile-time 
types,  instead  of  just  function  types.  This  follows  [GJ91],  where  there  is  no  such  restriction.  Also, 
we  find  it  convenient  to  divide  the  variables  and  contexts  into  run-time  and  compile-time,  which 
involves  a  small  change  in  the  “up”  and  “down”  rules.  All  other  differences  to  [NN92]  are  due  to 
minor  differences  between  their  underlying  language  and  Mini-ML. 
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4.1  Syntax 


Run-time  Types  r 

ri:±T2  1  T1XT2 

Compile-time  Types  a 

nat 

<7i=F(T2  I  O'!  X  (72  I  r 

Terms  e 

X 

Xx:t.  e  1  ei@e2 

fix  x:t.  e  I  (  ei,  62  )  I  fst  e  I  snd  e 
z  1  s  e  1  fcase  ei  of  z  =>  e?.  1  s  x  ^ 

\y 

Xy:cr.  e  |  ei@e2 

fix  y :cr.  e  |  (  ei ,  €2  )  j  fst  e  |  snd  e 
z  1  s  e  1  (case  ei  of  z  62  I  s  y  63) 

Run-time  Contexts  F 

11= 

T,^.t 

Compile-time  Contexts  A 

A,y:cr 

4.2  Typing  Rules 
Run-time  Typing 

x:t  in  P 

- tpr.var 

A;  r  F  X  :  r 

A;  r,  s:r2  F  e  :  r  A;  T  F  ei  :  to—^t  A;  P  F  62  :  r2 

- tprJam  - tpr_app 

A;  P  F  A«:r2.  e  :  T2^t  A;  P  F  ei®e2  :  r 


A;  P,  x:t  F  e  :  r 

- - tpr  Jix 

A;  P  F  fix  ^:r.  e  :  r 

A;  P  F  €1  :  n  A;  P  F  62  :  T2 
A;P  F  (  ei,e2  )  :  tiXT2 


tpr_pair 


A;  P  F  e  :  T1XT2 
- tpr_fst 

A;PFMe  :n 


A;  P  F  e  : 

- tpr_snd 

A;  P  F  snd  e  :  T2 


- tpr_z 

A;  P  F  z  :  nat 


A;  P  F  e  :  nat 

- ^tpr_s 

A;  P  F  se  : 


A;PFei:nat  A;PFe2:T  A;  P,  x  :  nat  F  63  :  r 
- tpr_case 

A;  P  F  (case  ei  of  z  €2  I  s  «  63)  :  r 


A  F  e  :  r  , 
- down 


A;PF  e  :  r 
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Compile-time  Typing 


y:a  in  A 


tpc-var 


A,  y:a2  e  \  a 


tpc_lam 


A  U  y  :  (7 

A  P  ei  :  a2=Ta 


A  K  62  :  (T2 


tpc.app 


A  F  Ay:a2.  e  :  a2^cr  A  6i@e2  :  (t 

A,y:a  P  6  :  a 

-  tpcJix 


A  F  fix  y:(7.  e  :  a 
A  U  6i  :  ai  A  U  62  :  cr2 


A  P  (  61,62  )  :  criXcr2 


tpc^pair 


A  K  6  :  (Ti  X  (72 
- ^ - tpC-fst 

A  P  fst  6  :  (Ji 


■  tpc_z 


A  F  6  :  (Ji  X  (72 


A  K  snd  6  :  (72 


tpc^snd 


A  K  6  :  nat 


A  F  z  :  nat  A  F  S6  :  nat 

A  F  61  :  nat  A  P  62  :  (7  A,  y  :  P  63  :  a 

A  P  (case  61  of  z  =>  62  \  sy  e^)  >  <J 

A; .  U  6  :  r 


tpc_s 


tpc>.case 


A  h"  6  :  r 


up 


Note  that  we  remove  run-time  assumptions  at  the  down  rule,  while  in  [NN92]  this  is  done  later 
at  the  up  rule.  This  change  is  justified  since  by  the  structure  of  their  rules,  such  assumptions  can 
never  be  used  in  the  compile-time  deduction  in  between. 

4.3  Translation  to  Implicit  Modal  Mini-ML 

The  translation  to  Mini-ML°  is  now  very  simple.  We  translate  both  run-time  and  compile-time 
Mini-ML  fragments  directly,  and  insert  □,  box  ,  unbox  and  pop  to  represent  the  changes  between 
phases.  We  define  two  mutually  recursive  functions  to  do  this:  ||  ■  ||  is  the  run-time  translation  and 
I  •  1  is  the  compile-time  translation.  We  overload  this  notation  between  types  and  terms.  We  write 
6  and  6  to  match  any  term  whose  top  constructor  matches  the  phase  annotation. 

Run-time  Types 

||nat||  =  nat 

lki^'7-2|j  =  Ikill”^  1^211 
I|rixr2||  =  ||ri||x||r2|| 
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Compile-time  Types 


Run-time  Terms 


lease  ei  of  z  : 


Compile-time  Terms 


|nat| 

= 

nat 

kil  ->  k2l 

ki'xaaj 

kil  X  k2| 

|rl 

— 

alkll 

INI 

X 

11^:7-.  e|l 

Az:l|r||.  ||e|| 

||ei@e2|| 

Ikill  Ileall 

||fix  x:t.  e\\ 

fix  ic:||rl|.  ||e|| 

ll(.ei,C2l|| 

= 

(IhlMleall) 

IIM  ell 

— 

fst  ||e|| 

||snd  ej 

snd  ||e|| 

M 

z 

llsell 

slNI 

1  s  2.  eall 

= 

case  ||ei||  of  z 

ii'ii 

— 

unbox  (pop  |e 

Isl 

_ 

y 

\Xy:T\  e\ 

Xy:\T\.  le| 

_|ei@e2j 

kil  leal 

|flx  y:T.  e\ 

fix  y:|T|.  |e| 

\{euj2)\ 

=: 

(|eiMe2|) 

[fst  e| 

= 

fst  \e\ 

|snd  e| 

= 

snd  |e| 

|zi 

= 

z 

|sei 

s  \e\ 

■2  1  s  y  esi 

=: 

case  |ei|  of  z 

ki 

— 

box  ||e|| 

leal  \  sy=^  |e3| 


4.4  Equivalence  of  Binding  Time  Correctness  and  Modal  Correctness 

In  this  section  we  state  our  main  theorem,  which  is  that  binding  time  correctness  is  equivalent 
to  modal  correctness  of  the  translation  to  Mini-ML°.  We  write  V  ::  (J)  if  X>  is  a  derivation  of 
judgment  J. 

Theorem  1 

1.  If  ||e||  =  M  then: 

(a)  ifVr  ::  (A;r  F  e  :  r)  then  we  have  T>i  ::  (|A|;  ||r||  H  M  :  ||r||); 

(b)  if  Vi  ::  (|A|;  ||r||  P  M  :  A)  then  we  have  Vr  "  (1A|;  ||r||  F  e  :  r)  with  ||r||  =  A. 
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2.  If  \e\  =  M  then: 

(a)  ifVc  ::  (A  P  e  :  a)  then  we  have  Vj  ::  (|A|  P  M  :  |(t|); 

(b)  if  Vi  ::  (|A|  M  :  A)  then  we  have  "  (A  P  e  :  a)  with  |ct|  =  A. 

Proof:  By  simultaneous  induction  on  the  definitions  of  ||e||  and  |e|.  Note  that  we  can  take  advan¬ 
tage  of  strong  inversion  properties,  since  we  have  exactly  one  typing  rule  for  each  term  constructor 
in  Mini-ML°  and  Mini-ML2,  plus  the  up  and  down  rules  to  connect  the  P  and  K  judgements. 

We  only  show  the  two  cases  involving  both  definitions,  since  all  others  are  easy.  Note  that  at 
the  variables  we  need  to  rely  on  the  phase  annotations. 


Case:  ||e||  =  unbox  (pop  |e|).  To  show  part  la  we  note  that  by  inversion  we  have 

K 

P  _  APe:r 

- down 

A;rK  e  :r 

then  applying  part  2a  of  the  induction  hypothesis  to  P'  to  get  V'i  we  get 

. 

|A|  P  |e|  :  Dllrll 

^  ^ - tpi_pop 

*  |A|;||r||  P  pop  |e|  :  □||r||  _ 

- - tpi_unbox 

|A|;  ||r||  P  unbox  (pop  |e|)  :  ||r|| 


Now,  to  show  part  i 6  we  note  that  we  can  reverse  the  roles  of  the  inversion  and  proof 
construction  above,  and  use  part  2b  of  the  induction  hypothesis. 

Case:  |e|  =  box  ||€||  To  show  part  2a  we  note  that  by  inversion  we  have 


VI 

Vc=  A;-Ke:r 
^  - up 

A  K  e  :  r 


then  applying  part  la  of  the  induction  hypothesis  to  D'  to  get  V-  we  get 


Vi  = 


V'i 

|A|;-P  ||e||  :  ||r|| 


|Al  P  box  ||e||  :  0\\t 


t  pi -box 


Now,  to  show  part  2b  we  note  that  again  we  can  reverse  the  roles  of  the  inversion  and  proof 
construction  and  use  part  lb  of  the  induction  hypothesis. 


□ 
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By  examining  this  proof  we  can  verify  that  the  translation  of  a  two-level  term  can  always 
be  type-checked  only  using  the  tpi.unbox  and  tpi_pop  rules  when  tpi_unbox  immediately  follows 
tpi_pop.  This  corresponds  to  a  weaker  modal  logic,  K,  in  which  we  drop  the  assumption  in  S4  that 
the  accessibility  relation  is  reflexive  and  transitive  [MM94]. 

In  fact,  we  can  define  a  language  Mini-ML°'  by  replacing  the  unbox  and  pop  constructors 
with  one  equivalent  to  unboxi  as  in  [MM94].  Then,  Mini-ML^  closely  models  Mini-MLa,  but 
permits  an  arbitrary  number  of  phases,  each  of  which  can  only  execute  the  code  generated  by 
the  immediately ,  preceding  one.  This  is  similar  to  the  idea  of  5-level  languages  in  [NN92]  (with 
5  linearly  ordered),  and  in  fact  a  5-level  version  of  Mini-ML  would  be  exactly  equivalent  to 
Mini-ML^,  by  a  natural  extension  of  the  two-level  translation.  It  is  also  similar  to  the  Multi-level 
Generating  Extensions  of  [GJ95]. 

It  is  interesting  then  to  consider  what  the  reflexitivity  and  transitivity  assumptions  model  in 
the  context  of  staged  computation.  Essentially  they  allow  us  to  execute  generated  code  at  any 
future  time,  or  immediately.  It  would  be  difficult  to  achieve  the  same  in  an  extension  of  a  two-level 
language,  since  the  separation  between  the  levels  is  achieved  by  duplicating  the  term  and  type 
constructors.  Hence  we  consider  Mini-ML°  to  be  an  appropriate  language  in  which  to  study  more 
general  forms  of  staged  computation,  including  run-time  code  generation. 


5  Examples 

We  now  present  some  standard  examples  from  partial  evaluation  to  illustrate  the  expressiveness 
of  our  language  Mini-ML°.  We  use  let  x  =  Ei  ’va.  to  introduce  (non-polymorphic)  top-level 
definitions;  it  may  be  considered  as  syntactic  sugar  for  {Xx\A.  E2)  E\. 

5.1  Ackermann’s  Function 

We  now  present  a  program  for  calculating  Ackermann’s  function  that  specializes  to  the  first  argu¬ 
ment.  It  is  based  on  the  following  program; 

let  ackermann  =  fix  acker :nat  nat  — )•  nat. 

Am:nat.  case  m 

of  z  ^  An:nat.  sn 
I  s  Tn'=>  Aninat.  case  n 

of  z  ^  acker  m'  (s  z) 

I  s  n'=^  {acker  m'  {acker  m  n')) 

in  ... 

Now,  if  we  attempt  to  directly  insert  the  modal  constructors  to  divide  this  program  into  two 
stages,  we  get  the  following: 

let  ackermann  =  fix  ackerinat  □(nat  nat). 

Am:nat.  case  m 

of  z  box  (An:nat.  sn) 

I  s  m'=^  box  (An:nat.  case  n 

of  z  (unboxi  {acker  m'))  (s  z) 

I  s  n'=^  (unboxi  {acker  m'))  ((unboxi  {acker  m))  n')) 

in  . . . 
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Unfortunately,  when  applied  to  the  first  argument,  this  function  generally  won’t  terminate. 
This  is  a  common  problem  in  partial  evaluation,  and  the  usual  solution  is  employ  memoization 
during  specialization,  which  works  for  many  programs.  Here  we  will  simply  note  that  the  problem 
in  this  case  is  a  recursive  call  to  acker  m  while  calculating  acker  m,  which  can  be  removed  by 
adding  an  additional  fix  as  follows. 

let  ackermann  =  fix  acfcer:nat  □(nat  nat). 

Am:nat.  case  m 

of  z  =>  box  (Anrnat.  sn) 

I  s  m':=>  box  (fix  ackm.  An:nat. 
case  n 

of  z  (unboxi  (acker  m'))  (s  z) 

I  s  (unboxi  (acker  m'))  (ackm  n')) 

in  ... 

This  function  will  always  terminate.  The  recursive  applications  appearing  inside  uiibox_l  con¬ 
structors  are  evaluated  when  the  first  argument  is  given.  The  compilation  of  this  function  to 
Mini-MLg  makes  this  more  explicit: 

let  ackermann  =  fix  acfcer :nat  — >•  □(nat  — )•  nat). 

Am:nat.  case  m 

of  z  =>  box  (Anrnat.  sn) 

I  s  let  box  /  —  acker  m'  in 
let  box  g  —  acker  m'  in 
box  (fix  ackm.  Anrnat. 
case  n 

of  z  /  (s  z) 

I  s  n'=>  g  (ackm  n')) 

in  . . . 

Notice  that  acker  m’  is  unnecessarily  calculated  twice.  This  could  be  avoided  if  memoization 
was  employed  during  the  compilation. 

5.2  Inner  Products 

In  [GJ95]  the  calculation  of  inner  products  is  given  as  an  example  of  a  program  with  more  than  two 
phases.  We  now  show  how  this  example  can  be  coded  in  Mini-ML°.  Note  that  we  have  assumed 
a  data  type  vector  in  the  example,  along  with  a  function  sub:mt  vector  nat  to  access  the 
elements  of  a  vector. 

Then,  the  inner  product  example  without  staging  is  expressed  in  Mini-ML  as  follows: 

let  iprod  ~  fix  zprnat  vector  vector  nat. 

Anrnat.  case  n 

of  z  Aurvector.  Au;:vector.  z 

I  s  Aurvector.  Awrvector. 

plus  (times  (sub  n  u)  (sub  n  u;))  (ip  n'  v  w) 


in . . . 
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We  add  in  □,  box  and  unbox,  to  get  a  function  with  three  computation  stages.  We  assume  a 
function  liftnat  ^  defined  earlier  and  a  function  sub':nat  □(vector  nat)  which  is  a  specializing 
version  of  sw6,  that  perhaps  pre-computes  some  pointer  arithmetic  based  on  the  array  index.  We 
first  define  a  staged  version  times'  of  times  which  avoids  the  multiplication  in  the  specialization  if 
the  first  argument  is  zero.  This  will  speed  up  application  of  iprod'  to  its  third  argument,  particularly 
in  the  case  that  the  second  argument  is  a  sparse  vector. 

let  times' □(nat  — )•  nat))  = 
box  (Am:nat.  case  m 

of  z  =>  box  (An:nat.  z) 

I  s  m'^  box  (An:nat.  times  n  (unboxi  (/(ftpat  ’^)))) 
iprod'  =  fix  ip-.nat  □(vector  □(vector  nat)). 

An:nat.  case  n 

of  z  box  (Auivector.  box  (Atuwector.  z)) 

I  s  n'=^  box  (Auwector.  box  (Atuivector. 

plus  (unboxi  (unboxi  fimes'(unboxi  {sub'  n)  u)) 

(unbox2  {sub'  n)  lu)) 

(unboxi  (unboxi  {ip  n')  v)  w))) 
iprodS  :  □(vector  □(vector  — ^  nat))  =  iprod'  3 
iprodSa  :  □(vector  nat)  =  unbox  iprodS  [7, 0, 9] 

iprodSb  :  □(vector  nat)  =  unbox  iprodS  [7, 8,  0] 

The  last  four  lines  show  how  to  execute  the  result  of  a  specialization  using  unbox  without  pop 
(corresponding  to  eval  in  Lisp).  Also,  the  occurrence  of  unbox2  indicates  code  used  at  the  third 
stage  but  generated  at  the  first.  These  two  aspects  could  not  be  expressed  within  a  multi-level 
language. 

Note  the  erasure  of  the  unbox,  and  box  constructors  in  iprod'  leaves  iprod,  except  that  we 
used  a  different  version  of  multiplication.  The  operational  semantics  of  the  two  programs  is  of 
course  quite  different. 

5.3  Regular  Expression  Matching 

We  now  present  a  program  for  regular  expression  matching  that  specializes  to  a  particular  regular 
expression.  We  use  the  full  Standard  ML  language,  augmented  with  our  modal  constructors.  Our 
program  is  based  on  the  following  non-specializing  one,  which  makes  use  of  a  continuation  function 
that  is  called  with  the  remaining  input  if  the  current  matching  succeeds. 

datatype  regexp 
=  Empty 

I  Plus  of  regexp  *  regexp 
I  Times  of  regexp  *  regexp 
1  Star  of  regexp 
I  Const  of  string 

(*  val  accept’  :  regexp  ->  (string  list  ->  bool)  ->  (string  list  ->  bool)  *) 


in  let 


in  let 
in  let 
in  let 
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fun  accept’  (Empty)  k  s  =  k  s 

I  accept’  (Plus(rl,r2))  k  s  =  accept’  rl  k  s  orelse  accept’  r2  k  s 

I  accept’  (Times (rl,r2))  k  s  =  accept’  rl  (fn  s’  =>  accept’  r2  k  s’)  s 

I  accept’  (Star(r))  k  s  = 
k  s  orelse 

accept’  r  (fn  s’  =>  if  s  =  s’  then  false  else  accept’  (Star(r))  k  s’)  s 
I  accept’  (Const(str))  k  (x::s)  =  (x  =  str)  andalso  k  s 
I  accept’  (Const (str))  k  (nil)  =  false 

(♦  val  accept  :  regexp  ->  (string  list  ->  bool)  *) 

fun  accept  r  s  =  accept’  r  (fn  nil  =>  true  |  (x::l)  =>  false)  s 


Note  that  there  is  a  recursive  call  to  accept’  Star(r)  in  the  case  for  accept’  Star(r)  which 
we  can  transform  using  a  local  definition,  similar  to  the  fix  introduced  in  the  Ackermann  function 
example.  This  must  be  done  so  that  specialization  with  respect  to  the  regular  expression  terminates. 
The  resulting  code  for  this  case  is: 

I  accept’  (Star(r))  k  s  = 
let  fun  acc’  k  s  = 

k  s  orelse 

accept’  r  (fn  s’  =>  if  s  =  s’  then  false  else  acc’  k  s’)  s 
in 

acc’  k  s 

end 

Then,  we  can  add  in  modal  constructors  to  get  our  staged  program  with  the  following  types 
(using  $  here  to  represent  □) 

val  accept2’  :  regexp  ->  $( (string  list  ->  bool)  ->  (string  list  ->  bool)) 
val  accept2  :  regexp  ->  $ (string  list  ->  bool) 

These  types  indicate  that  the  required  staging  is  achieved  by  the  program.  Inserting  the  modal  con¬ 
structors  requires  breaking  up  the  function  arguments,  but  is  otherwise  relatively  straightforward. 
We  use  unbox^l  for  unboxi  —  unbox  (pop  •). 


fun  accept2’ 

1  accept2’ 
box 


I  accept2’ 
box 

I  accept2’ 
box 


(Empty)  =  box  (fn  k  =>  fn  s  =>  k  s) 

(Plus(rl,r2))  = 

(fn  k  ->  fn  s  => 

(unbox_l  (accept2’  rl))  k  s 
orelse  (unbox.l  (accept2’  r2))  k  s) 

(Times (rl,r2))  = 

(fn  k  =>  fn  s  => 

(unbox.l  (accept2’  rl))  (fn  s’  =>  (unbox.l  (accept2’  r2))  k  s’)  s) 
(Star(r))  = 

(fn  k  =>  fn  s  => 
let  fun  acc2’  k  s  = 
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k  s  orelse 

((unbox_l  (accept2’  r)) 

(fn  s’  =>  if  s  =  s’  then  false  else  acc2’  k  s’)  s) 
in 

acc’  k  s 
end) 

I  accept2’  (Const (str))  = 

box  (fn  k  =>  (fn  s  => 

case  s  of  (x: :s’)  =>  (x  =  lift_string  str)  andalso  k  s’ 

I  nil  =>  false)) 

fun  accept2  r  =  box  (fn  s  =>  (unbox_l  (accept2’  r)) 

(fn  s’  =>  case  s’  of  nil  =>  true  |  (x::l)  =>  false) 
s) 

We  can  now  use  our  compilation  to  the  explicit  language  Mini"ML°  to  get  an  equivalently 
staged  program  without  the  modal  operators.  We  can  then  further  translate  to  a  program  in 
pure  Standard  ML,  which  is  staged  in  the  same  way,  but  without  the  modal  annotations.  It  is 
uneccessary  to  replace  $A  by  unit  ->  A  in  this  case,  since  box  is  only  applied  to  values.  We  show 
this  program  only  to  demonstrate  the  staging  described  by  the  the  modal  annotated  program.  The 
program  in  Mini-ML°  has  the  potential  to  be  more  efficient,  since  optimized  code  can  be  generated 
by  a  sophisticated  implementation. 

(*  val  accepts’  :  regexp  ->  (string  bool)  ->  (string  ->  bool)  +) 

fun  accepts’  (Empty)  =  (fn  k  =>  fn  s  =>  k  s) 

I  accepts’  (Plus(rl,r2))  = 
let  val  al  =  accepts’  rl 

val  a2  =  accepts’  r2 

in 

(fn  k  =>  fn  s  =>  al  k  s  orelse  a2  k  s) 

end 

I  accepts’  (Times (rl,r2))  = 
let  val  al  =  accepts’  rl 

val  a2  =  accepts’  r2 

in 

(fn  k  =>  fn  s  =>  al  (fn  s’  =>  a2  k  s’)  s) 

end 

I  accepts’  (Star(rl))  = 
let  val  al  =  accepts’  rl 

fun  acc2  k  s  =  k  s  orelse 

al  (fn  s’  =>  if  s  =  s’  then  false  else  acc2  k  s’)  s 
in 

(fn  k  =>  fn  s  =>  acc2  k  s) 

end 

I  accepts’  (Const (str))  = 
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(fn  k  =>  (fn  (x::s)  =>  (x  =  str)  andalso  k  s 
I  nil  =>  false)) 

(*  val  accepts  :  regexp  ->  (string  ->  bool)  *) 

fun  accepts  r  =  accepts’  r  (fn  nil  =>  true  I  (x::l)  =>  false) 

6  Conclusion  and  Future  Work 

In  this  paper  we  have  proposed  a  logical  interpretation  of  binding  times  and  staged  computation 
in  terms  of  the  intuitionistic  modal  logic  S4.  We  first  presented  an  explicit  language  Mini-ML° 
(including  recursion,  natural  numbers,  and  pairs)  and  its  natural  operational  semantics.  This 
language  is  too  verbose  to  be  practical,  so  we  continued  by  defining  an  implicit  language  Mini-ML° 
which,  with  some  syntactic  sugar,  might  serve  as  the  core  for  an  extension  of  a  language  with  the 
complexity  of  Standard  ML.  The  operational  semantics  of  Mini-ML°  is  given  by  a  compilation 
to  Mini-ML°.  It  generalizes  Nielson  &  Nielson’s  two-level  functional  language  [NN92]  which  is 
demonstrated  by  a  conservative  embedding  theorem,  the  main  technical  result  of  this  paper. 

The  two-level  language  we  consider,  Mini-ML2,  is  directly  based  on  the  one  in  [NN92],  but  has 
a  stricter  binding-time  correctness  criterion  than  used,  for  example,  in  [GJ91].  Essentially,  this 
restriction  may  be  traced  to  the  fact  that  our  underlying  evaluation  model  applies  only  to  closed 
terms,  while  [GJ91]  seems  to  require  evaluation  of  terms  with  free  variables.  Gluck  and  Jprgensen 
[GJ95]  present  a  multi-level  binding- time  analysis  with  the  less  strict  binding-time  correctness 
criterion,  along  with  practical  motivations  for  multi-level  partial  evaluation,  though  they  do  not 
treat  higher  order  functions.  A  modal  operator  similar  to  the  “next”  operator  from  temporal  logic 
looks  promising  as  a  candidate  to  model  this  looser  correctness  criterion,  but  we  have  yet  to  develop 
this  line  of  research. 

Our  language  Mini-ML°  requires  the  insertion  of  the  box,  unbox  and  pop  coercions  into  a 
functional  program.  It  may  be  preferable  for  these  coercions  to  remain  implicit,  though  in  such  a 
language  valid  expressions  no  longer  have  unique  or  even  principal  types,  thus  raising  coherence 
problems.  We  intend  to  study  a  language  in  which  the  modal  types  are  considered  refinements  of 
the  usual  Mini-ML  types,  using  intersections  to  express  principal  types  (see  [FP91]  for  analogous 
non-modal  refinement  types).  Refinement  type  inference  for  this  language  would  be  a  form  of 
generalized,  polyvariant  binding-time  analysis.  Compilation  would  be  type-directed,  generating 
different  versions  of  functions  appropriate  for  different  stagings  of  computation.  The  programmer 
would  control  this  process  through  refinement  type  constraints  imposed  upon  functions  by  type 
annotations.  Type  inference  in  such  a  language  would  need  to  depend  strongly  on  subtyping  via 
implicit  coercions  between  refinement  types. 

Our  operational  semantics  is  also  rather  naive  from  a  partial  evaluation  point  of  view.  In 
particular,  we  do  not  memoize  during  specialization.  A  memoizing  semantics  would  be  desirable 
for  a  serious  implementation,  and  would  require  some  restrictions  on  side-effects.  See  [BW93]  for 
a  description  of  a  serious  partial  evaluator  for  Standard  ML,  which  in  part  inspired  this  work. 

This  paper  does  not  treat  polymorphism,  though  it  seems  that  it  should  not  cause  any  problems. 
We  expect  our  type  system  to  interact  very  well  with  ML’s  module  system.  In  fact,  part  of 
our  motivation  was  to  provide  the  programmer  with  means  to  specify  staging  (=  binding  time) 
information  in  a  signature  and  thus  propagate  it  beyond  module  boundaries. 
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Our  approach  provides  a  general  logically  motivated  framework  for  staged  computation  that 
includes  aspects  of  both  partial  evaluation  and  run-time  code  generation.  As  such  it  should  allow 
efficient  code  to  be  generated  within  a  more  declarative  style  of  programming,  and  provides  an 
automatic  check  that  the  intended  staging  is  achieved.  We  have  implemented  a  simple  version  of 
Mini-ML°  in  the  logic  programming  language  Elf  [Pfe91].  To  date  we  have  only  experimented  with 
small  examples,  but  plan  to  carry  out  more  realistic  experiments  in  the  near  future. 
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